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AGING OF THE METROPOLIS DYNAMICS ON THE RANDOM 

ENERGY MODEL 

Jim CERNY AND TOBIAS WASSMER 


Abstract. We study the Metropolis dynamics of the simplest mean-field spin glass 
model, the Random Energy Model. We show that this dynamics exhibits aging by show¬ 
ing that the properly rescaled time change process between the Metropolis dynamics and 
a suitably chosen ‘fast’ Markov chain converges in distribution to a stable subordinator. 
The rescaling might depend on the realization of the environment, but we show that its 
exponential growth rate is deterministic. 


1. Introduction 

This paper studies the out-of-equilibrium behavior of the Metropolis dynamics on the 
Random Energy Model (REM). Our main goal is to answer one of the remaining important 
open questions in the field, namely whether this dynamics exhibits aging, and, if yes, 
whether its aging behavior admits the usual description in terms of stable Levy processes. 

Aging is one of the main features appearing in the long-time behavior of complex disor¬ 
dered systems (see e.g. |BCKM97] for a review). It was for the first time observed experi¬ 
mentally in the anomalous relaxation patterns of the residual magnetization of spin glasses 
(e.g. jLSNB83l ICha84j ). One of the most influential steps in the theoretical modeling of the 
aging phenomenon is the introduction of the so-called trap models by Bouchaud |Bon92] 
and Bouchaud and Dean |BD95j . These models, while being sufficiently simple to allow 
analytical treatment, reproduce the characteristic power law decay seen experimentally. 

Since then a considerable effort has been made in putting the predictions obtained from 
the trap models to a solid basis, that is to derive these predictions from an underlying 
spin-glass dynamics. The first attempt in this direction was made in |BBG02[ IBBG03al 
IBBGn3b] where it was shown that, for a very particular Glauber-type dynamics, at time 
scales very close to the equilibration, a well chosen two-point correlation function converges 
to that given by B ouchau d’s trap model. 

With the paper jBGn8 , where the same type of dynamics was studied in a more general 
framework and on a broader range of time scales, it emerged that aging establishes itself 
by the fact that scaling limits of certain additive functionals of Markov chains are stable 
Levy processes, and that the convergence of the two-point correlation functions is just a 
manifestation of the classical arcsine law for stable subordinators. 

The Glauber-type dynamics used in those papers, sometimes called random hopping 
time (RHT) dynamics, is however rather simple and is often considered as ‘non-realistic’, 
mainly because its transition rates do not take into account the energy of the target state. 
Its advantage is that it can be expressed as a time change of a simple random walk on 
the configuration space of the spin glass, which allows for a certain decoupling of the 
randomness of the dynamics from the randomness of the Hamiltonian of the spin glass, 
making its rigorous studies more tractable. 
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For more realistic Glauber-type dynamics of spin glasses, like the so-called Bouchaud’s 
asymmetric dynamics or the Metropolis dynamics, such decoupling is not possible. As a 
consequence, these dynamics are far less understood. 

Recently, some progress has been achieved in the context of the simplest mean-held 
spin glass model, the REM. First, in |MM15j . the Bouchaud’s asymmetric dynamics have 
been considered in the regime where the asymmetry parameter tends to zero with the size 
of the system. Building on the techniques started in |Moullj . this papers conhrms the 
predictions of Bouchaud’s trap model in this regime. Second, the Metropolis dynamics 
have been stndied in |Gayl4| , for a truncated version of the REM, using the techniqnes 
developed for the symmetric dynamics in |Gayl2 GaylOj , again conhrming Bouchand’s 
predictions. 

The weak asymmetry assumption of |MM15j and the truncation of |Gayl4| have both 
the same purpose. They aim at overcoming some specihc features of the asymmetry and 
recovering certain features of symmetric dynamics. Our aim in this work is to get rid of 
this simplihcations and treat the non-modihed REM with the usnal Metropolis dynamics. 

Let us also mention that Bonchaud’s asymmetric dynamics (and implicitl y the Metrop¬ 
olis one) is rather well understood in the context of trap models on Z'’*, see IbGIII IGerll 


GSI 3 ] . where it is possible to exploit the connections to the random conductance model 


with unbounded conductances, |BD10j . Finally, the Metropolis dynamics on the complete 
graph was considered in |Gayl2|. 


Before stating our main result, let us briefly recall the general scheme for proving aging 
in terms of convergence to stable Levy processes. The actual spin glass dynamics, X = 
{Xt)t>o, which is reversible with respect to the Gibbs measure of the Hamiltonian, is 
compared to another Markov chain Y = (Ft)t>o on the same space, which is an ‘accelerated’ 
version of X and whose stationary measure is uniform. The process Y is typically easier 
to be understood, e.g. it is a simple random walk for the RHT dynamics, and the original 
process X can be written as its time change, 

Xit) = YiS-\t)), ( 1 . 1 ) 

for the right continnous inverse S~^ of a certain additive functional S of the Markov chain 
Y, called the ‘clock process’. The aim is then to show convergence of the properly rescaled 
clock process S to an increasing stable Levy process, that is to a stable subordinator. 


We now state our main result. We consider the unmodihed REM, as introduced in 
|Der801 IDerSlj . The state space of this model is the Wdimensional hypercube Hjv = 
{ —1,1}^, and its Hamiltonian is a collection {E^)x£mN of i-i.d. standard Gaussian random 
variables dehned on some probability space (G, P). The non-normalized Gibbs measure 

Tx = at inverse temperature /3 > 0 gives the equilibrium distribution of the system. 

The Metropolis dynamics on the REM is the continuous-time Markov chain X = (Xj)f>o 
on EItv with transition rates 

r^y = (^1 x,yeMN. ( 1 . 2 ) 

Here, x ^ y means that x and y are neighbors on EItv, that is they differ in exactly one 
coordinate. 

As explained above, we will compare the Metropolis chain X with another ‘fast’ Markov 
chain Y = {Yt)t>o with transition rates 

Tx A Ty 

1 A _ 


1,9 e Hm. 


(1.3) 
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It can be easily checked using the detailed balance conditions that Y is reversible and that 
its equilibrium distribution is 

1 Lr ^ TUT 

Px = —, X G Mat, 

Zn 

where Finally, since r^y = (1 V Tx)~^qxy, can be written as a time 

change of Y as in fll.ip with the clock process S being given by 

S{t) = [ (iVryJds. (1.4) 

Jo 

For the rest of the paper we only deal with the process Y and the clock process S, the actual 
Metropolis dynamics X does not appear anymore after this point. For a hxed environment 
T = (Tx)xeH]v) let PJ" denote the law of the process Y started from its stationary distribution 
z/, and let P([0,T],M) be the space of R-valued cadlag functions on [0,T]. We denote by 
(Jc = \/2 log 2 the (static) critical temperature of the REM. Our main result is the following. 


Theorem 1.1. Let a G (0,1) and > 0 be such that 


1 


(1.5) 


and define 

gN = {afiV2TrN)~^. ( 1 . 6 ) 

Then there are random variables Rn which depend on the environment {Ex)xeUN only, such 
that for every T > 0 the rescaled clock processes 

SM{t)=gfij^S{tR^), te[0,T], 


converge in ^-probability as N ^ oo, in Pf -distribution on the space P([0,T],M) equipped 
with the Skorohod Mi-topology, to an a-stable subordinator V^. The random variables Rjsf 
satisfy 


lim 

N^OO 


log Rn 
N 


2 


P-a.s. 


(1.7) 


Let us make a few remarks on this result. 

1. The result of Theorem o conhrms that the predictions of Bouchaud’s trap model 
hold for the Metropolis dynamics on the REM, at least at the level of scaling limits of 
clock process es. It a lso compares directly to the results obtained for the symmetric (RHT) 
dynamics in |BCn8j . The scales gN and Rn are (up to sub-exponential prefactors) the 
same a s previously, including the condition 01.51) or the range of parameters a,/?. As in 
BCOS] , the right inequality in 01.5p is completely natural, beyond it Y ‘feels’ the hniteness 
of Hat and aging is not expected to occur. The left inequality in 01.5p is technical, it 
ensures that the relevant deep traps are well separated (cf. Lemma [2Tp . introducing certain 
simplihcations in the proof. We believe that this bound might be improved to /fil > 0, 
by further exploiting our method. Finally, as previously, note that 01.5p is satished also 
for (3 < fic for appropriate a, hence aging can occur above the critical temperature. 


2. O ur choi ce of the fast chain Y is rather unusual. In view of the previous papers 
it would be natural to take instead the ‘uniform chain’ Y with transition 
that is without the correction 1 A Tj, which appears in 01.31) . This chain is 


|MM151 iBCll 


rates Tx A r, 


S/! 


reversible with respect to the uniform distribution on Hat. This choice has, however, some 
dehciencies. On the heuristic level, Y is not an acceleration of X, since it is much slower 
than X on sites with very small Gibbs measure Xj, -C 1. These sites, which are irrelevant 
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for the statics, then ‘act as traps’ on Y, making them relevant for the dynamics, which is 
undesirable. On the technical level, the trapping on sites with small Gibbs measure has 
the consequence that the mixing time of Y is very large. 

Our choice of the fast chain Y runs as fast as X on the sites with small Gibbs measure 
and thus does not have this dehciency. Moreover, since whenever > 0, the 

equilibrium distribution of the fast chain Y is still uniform on the relevant deep traps, so 
the clock process S retains its usual importance for aging. 

Remark also that in order to overcome the similar difficulties, |MM15] truncate the 
Hamiltonian of the REM at 0 which effectively sets > 1 for all x G We prefer to 
retain the full REM and use the modihed fast chain Y instead. Finally, |Gayl4| uses the 
discrete skeleton of X as the base chain, which has some interesting features but introduces 
similar undesirable effects. 

3. We view Theorem o as an aging statement, without further considering any two- 
point correlation functions. Actually, it seems hard to derive aging statements for the 
usual correlation functions from our result without extending the paper considerably. Such 
derivation usually requires some knowledge of the fast chain Y that goes over the Mi- 
convergence of the clock processes. This knowledge is typically automatically obtained in 
the previous approaches. The strength (or the weakness) of our method is that we do not 
need to obtain such hner knowledge to show the clock process convergence. 

4. A rather unusual feature of Theorem II.II is the fact that the scaling Rjy is random, it 
depends on the random environment. This again a consequence of our technique. Glaim 
fll.71) in Theorem 11.11 however shows that at least the exponential growth of Rn is deter¬ 
ministic. The random scale R^ is explicitly dehned in 02 .101) . We will see that its dehnition 
depends on a somewhat free choice of an auxiliary parameter, but nevertheless the hnal 
result does not depend on this parameter. Not only this property makes us conjecture that 
Rn should actually satisfy a deterministic law of large numbers, 

lim = 1, P-a.s., 


for some function h{N) growing at most sub-exponentially. 

5. The mode of convergence in Theorem 11.11 is not optimal, one would rather like 
to obtain the convergence in PJ'-distribution for P-almost every environment, which is 
usually called ‘quenched’ convergence. Actually, Theorem 1 1.1 1 can be strengthened slightly 
to a statement which is somewhere between P-a.s. convergence and convergence in P- 
probability. Namely, the statement holds for a.e. realization of sites with ‘small’ Tx, but 
only in probability over sites with ’large’ Tx, cf. Remark 16.41 

6 . Our proof of Theorem 11.11 strongly exploits the i.i.d. structure of the Hamiltonian of 
the REM. At present we do not know if it is possibl e to com bine our techniques with those 
used for the RHT dynamics of the p-spin model in |BBGn8 . IBG12] . 

We proceed by commenting on the proof of Theorem II.11 concentrating mainly on its 
novelties. The general strategy so far to prove such a result has been to hrst reduce the 
problem to the clock process restricted to a set of deep traps which govern the behavior of 
the original clock process. The different methods then all more or less aim at dividing the 
co ntribut ion of cons equently found deep traps into essentially i.i.d. blocks. For example 
or [BGllj . this is achieved by controlling the hitting probabilities of deep traps. 


m 


BGn8 


proving that they are hit essentially uniformly in exponentially distributed times, and 
controlling the time the chain spends at the deep traps by a sharp control of the Green 
function. Similar rather precise estimates on hitting probabilities and/or Green function are 
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necessary in other approaches. Using this i.i.d. strnctnre, one can then show convergence 
of the clock process by standard methods, e.g. compnting the Laplace transform. 

The meth od used in this paper is slightly inspired by the general approach taken in 
|FM14] and CW15 . There, models of trapped random walks on 1/ are considered where 


few information about the discrete skeleton as well as the waiting times of a continuous-time 
Markov chain are available, and minimal necessary conditions for convergence of the clock 
process are found. Taking up this idea, instead of analyzing in detail the behavior of the 
fast chain U, we extract the minimal amount of information needed to show convergence of 
the clock process. In particular, we do not need any exact control of hitting probabilities 
and Green functions of deep traps, as most previous work did. 

The hrst step in our proof is standard, namely that the main contribution to the clock 
process comes from a small set of vertices with large Gibbs measure Ta,, the so-called deep 
traps, and that in fact the clock process of the deep traps converges to a stable subordinator. 
Denote the set of deep traps hy Vjq (see Section |2] for details). We will show that the clock 
process S can be well approximated by the ‘clock process of the deep traps’ 



Then it remains to show that in fact g^^SvitEw) converges to a stable subordinator. 

To this end, we will in some sense invert the standard procedure described above. Instead 
of approximating the clock process by an i.i.d. block structure and then use the Laplace 
transform to show convergence, we will hrst compute a certain conditional Laplace trans¬ 
form using some special properties of the Metropolis dynamics. Then we analyze what is 
actually needed in order to show convergence of the unconditional Laplace transform. 

A bit more detailed, this will be done as follows. Under condition fll.Sp . the deep traps 
are almost surely well separated. This fact and the fact that the dehnition fll.3p contains 
the factor Tx A Ty imply that the transition rates Qxy of the fast chain Y do not depend on 
the energies Ex of the deep traps, but only on their location. Therefore, one can condition 
on the location of all traps and the energies Ex of the non-deep traps, which determines 
the law PJ of Y, and take the expectation over the energies of the deep traps. We call this 
a ‘quasi-annealed’ expectation, and denote it by for the moment. Let it{x) denote the 
local time of the fast chain Y (see Section [2] for details). As is simply an expectation 
over i.i.d. random variables, the quasi-annealed Laplace transform of the rescaled clock 
process of the deep traps given Y can be computed. It essentially behaves like 


Ei, I F] ^ exp I - [• (1-9) 

^ xeT>M ^ 

Here, is a deterministic sequence tending to 0 as iV —)■ cx). The above approximation 
shows that the only object related to Y we have to control is the local-time functional 

We will show that this a priori non-additive functional of Y actually behaves in an 
additive way, namely that it converges to f as —?> cxd, under PJ" for P-a.e. environment r. 
For this convergence to hold it is sufficient to have some weak bounds on the mean hitting 
time of deep traps as well as some control on the mixing of the chain Y together with an 
appropriate choice of the scale Pat that depends on the environment. 

Using standard methods we then strengthen the quasi-annealed convergence to quenched 
convergence (in the sense of Theorem ll.il) . 
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To conclude the introduction, let us comment on how our method might be extended. 
The key argument in the computation of the quasi-annealed Laplace transform, namely 
the fact that the chain V is independent of the depth of the deep traps, seems very specihc 
for the Metropolis dynamics. However, by adapting the method appropriately and using 
network reduction techniques, we believe that one could also treat Bouchaud’s asymmetric 
dynamics and Metropolis dynamics in the regime where the left-hand side inequality of 
fll.Sp fails, i.e. there are neighboring deep traps. 

The rest of the paper is structured as follows. Detailed dehnitions and notations used 
through the paper are introduced in Section [2l In Section |3] we analyze the mixing proper¬ 
ties of the fast chain V, which will be crucial at several points later. In Section 0] we give 
bounds on the mean hitting time of deep traps and on the normalizing scale Rjv- Using 
these bounds and the results on the mixing of U, we show concentration of the local time 
functional Section O We prove convergence of the rescaled clock 

process of the deep traps in Section 0] with the above mentioned computation of the quasi- 
annealed Laplace transform, using the concentration of the local time functional. Finally, 
we treat the shallow traps in Section [7] by showing that their contribution to the clock 
process can be neglected. In Appendix |A] we give the proof of a technical result which is 
used to bound the expected hitting times in Section 01 

2. Definitions and notation 

In this section we introduce some notation used through the paper and recall a few 
useful facts. We use Mn to denote the A^-dimensional hypercube { — 1,1}^ equipped with 
the usual distance 

1 ^ 

d{x,y) = - 

i=l 

and write Sn for the set of nearest-neighbor edges £n = {{x,y} : d{x,y) = 1 }. 

For given parameters a and f3, let 

7 = ( 1 / 2 , 1 ), ( 2 . 1 ) 

by condition fll.5p in Theorem ll.il 

Recall from the introduction that {E^ : x G IInjN > 1), is a family of i.i.d. standard 
Gaussian random variables dehned on some probability space (D, P). Note that we do 

not denote the dependence on N explicitly, but we assume that the space (D, E, P) is the 
same for all N. For /3 > 0 the non-normalized Gibbs factor Tx is given by Tx = 

Using the standard Gaussian tail approximation, 

f’[Ex >t] = —-= -|- o(l)) as f oo, (2.2) 

tv 27r 

we obtain that as dehned in Theorem ll.il satishes 

W[tx > ugjsi] = + o(l)). 

This heuristically important computation explains the appearance of stable laws in the 
distribution of sums of Tx'- If we observe 2'*'^ vertices, then hnitely many of them have 
their rescaled Gibbs measures Tx/gN of order unity, and, moreover, those rescaled Gibbs 
measures behave like random variables in the domain of attraction of an a-stable law. 

Recall also that Y = (F/)t>o stands for the fast Markov chain whose transition rates qxy 
are given in fll.dp . and that i> = {i'x)x&In denotes the invariant distribution of this chain. 
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Vx = For a given environment r = (ra;) 3 ;eH^, let PJ and PJ denote the laws of Y 

started from a vertex x or from u respectively, and El the corresponding expectations. 

Note that the normalization factor Zj^ = X]a;eHjv(^ ^ satishes, for every constant 

A e (0,1/2), 

< 2^ P-a.s for N large enough. (2.3) 


Indeed, obviously Zj^ < 2^, and Z^- > X^xsHjv l{Fa:>o} are i.i.d. Bernoulli 

random variables, therefore the statement follows immediately by the law of large numbers. 
An important role in the study of properties of Y is played by the conductances dehned 


by 

Tx/\Ty 

Cxy = ^xQxy = - for X ^y. 

Zjn 

Let Og be the left shift on the space of trajectories of Y, that is 


(2.4) 


(»T)i = n+<. 


(2.5) 


Let Hx = infjf > 0 : F) = x} be the hitting time of x by Y, Ji the time of the hrst jump 
of Y, and let = Hx o 6j^ + Ji = inf{t > A : 1/ = x} be the return time to x by Y. 
Similarly dehne Ha and H\ for a set A C Hjv. The local time of Y is given by 

^t{x) = / l{Ys=x}ds. 

Jo 

Using this notation the clock process S introduced in 01.41) can be written as 

S{t) = j (lVryJds= ^ it{x){iyTx). 

xeHjv 


To dehne the set of deep traps and the random scale Rn mentioned in the introduc¬ 
tion we introduce a few additional parameters. For a G (0,1), /? > 0 as in Theorem 11.11 
and 7 as in 02 .ip . we hx 7 ' and a' such that 


1 

2 


< 7 ' < 7 , 


and a' 



( 2 . 6 ) 


An explicit choice of 7 ' will be made later in Section |3 We dehne the auxiliary scale 


and set 

Pat = {x e Htv : Tx > g^}. 

to be the set of deep traps. By the Gaussian tail approximation 02.2p it follows that the 
density of P^v satishes 

P[a:Gp 7 v] = 2-^''^(l + o(l)). (2.7) 

We quote the following observation on the size and sparseness of Pat. The sparseness will 
play a key role in our computation of the quasi-annealed Laplace transform in Section [ 6 l 


Lemma 2.1. 


Bcns 


Lemma 3.7] For every £ > 0, P-a.s. for N large enough, 


e (1 -£,! + £). 


( 2 . 8 ) 


Moreover, since 7 ' > 1/2, there exists <5 > 0 such that P-a.s. for N large enough, the 
separation event 

y'= {mm{d{x,y) : x,yeVN}>dN} (2.9) 

holds. 
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Finally, for the sake of concreteness, let us give the explicit form of the random scale 

Rn, 




Ei[tT.jxr] 

El[HA 


-1 


( 2 . 10 ) 


where T^ix denotes the mixing time of K, a randomized stopping time which we will 
construct in Section [31 The reason for this dehnition will become apparent when we prove 
the concentration of the local time functional mentioned in the introduction. Although 
the dehnition oi Rn seems arbitrary by the somewhat free choice of the parameter 7 ', 
Theorem 11.11 actually shows that asymptotically Rn will be independent of 7 '. 


For the rest of the paper, c, c', c" will always denote positive constants whose values may 
change from line to line. We will use the notation g = o(l) for a function g{N) that tends 
to 0 as —)■ cxo, and g = 0{f) for a function g{N) that is asymptotically at most of order 
f{N), i.e. liniTv^oo |5'(^)l//(^) < c, for some c> 0. 


3. Mixing properties of the fast chain 


The fact that the chain Y mixes fast, namely on a scale polynomial in N, plays a crucial 
role in many of our arguments. In this section we analyze the mixing behavior of Y. We 
hrst give a lower bound on the spectral gap Ay of Y, which we then use to construct a 
strong stationary time T^ix. 


Proposition 3.1. There are constants k > 0, K > 0, Cq > 0, such that P-a.s. for N large 
enough, 


Ay > 


We prove this proposition with help of the Poincare inequality derived in |DS91j . To 
state this inequality, let F be a complete set of self-avoiding nearest-neighbor paths on Hjv, 
that is for each x y E Hat there is exactly one path £ T connecting x and y. Let I 7 I 
be the length of the path 7 . By Proposition 1’ of [DS91] . using also the reversibility of Y 
and recalling the dehnition fl2.4p of the conductances, it follows that 


1 

-— < max 

Ay e={u,v}GSN 





(3.1) 


To minimize the right-hand side of fl3.ip . a special care should be taken of the edges 
whose conductance Cuv = {tu A t^/Zn is very small, that is which are incident to vertices 
with very small Tu- Those ‘bad’ edges should be avoided if possible by paths 7 G P. They 
cannot be avoided completely, since P should be a complete set of paths. On the other 
hand, if such edge is the hrst or the last edge of some path its small conductance is 
canceled by equally small or Vy. Therefore, to apply fl3.ip efficiently, one should hnd a 
set of paths P such that all paths 7 G P avoid ‘bad’ vertices, except for vertices at both 
ends of the paths. 

In the context of spin glass dynamics this method was used before in |FIKP98] to hnd 
the spectral gap of the Metropolis dynamics fll.2p . Using the same approach, that is using 
the same set of paths P as in |FIKP98] . we could hnd a lower bound on the spectral gap 
of the fast chain Y of leading order exp{—ci/AToglV}. This turns out to be too small for 
our purposes, cf. Remark 16.41 
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In the next lemma we construct a set of paths F that avoids more ‘bad’ vertices, which 
allows to improve the lower bound on the spectral gap to be polynomial in N. This is 
possible by using an embedding of EItv into its sub-graph of ‘good’ vertices, i.e. vertices 
with not too small Tx, which is inspired by similar embeddings in |HLN87] . 

For a nearest-neighbor path 7 = { xq , ..., we call the vertices Xi,... , Xn-i the inte¬ 
rior vertices of 7, and the edges {xj,Xj+i}, i = 1,... ,n — 2, the interior edges of 7. 

Lemma 3.2. There is an integer K > 0 and a constant Co > 0, such that P-a.s. for N 
large enough there exists a complete set of paths F, such that the following three properties 
hold. 

(i) For every path 7 G F, every interior edge e = {u,v} satisfies 

ZnCuv = ru a t^> 

( a ) I7I < 8N for all 7 G F. 

(Hi) Every edge e E £n is contained in at most N^2^~^ paths 7 G F. 

Proof. For Cq > 0, whose value will be hxed later, we say that x G Hat is good if Tx > 
and it is bad otherwise. To construct the complete set of paths F satisfying the 
required properties, we will use the fact that the set of good vertices is very dense in Hat. 
In particular, we will show that 

P-a.s. for N large enough, every x G EIjv has at least ^CoVN good neighbors, (3.2) 

and 

P-a.s. for N large enough, for any pair of vertices x, y at distance 2 or 3, there 

is a nearest-neighbor path of length at most 7 connecting x and y, such that (3.3) 

all interior vertices of this path are good. 

To prove these two claims, note hrst that for any x G ElAr, the probability of being bad 

is 


P[r^ < = ¥[Ex < -CoN-^ log AT] = 


-GoY“2 log V 




2 ds. 


For N large enough the integrand is larger than i, and it follows that 

P[x is bad] < ^(l - CoN~^ logiV) =: ^(1 - q^). 

Hence, the number of bad neighbors of a vertex x G EIjv is stochastically dominated by 
a Binomial(A^, |(1 — Q’at)) random variable B. For A > 0, the exponential Chebyshev 
inequality yields 

P|]x has more than N — -Cq^/N bad neighbors] 

< P[H > AT - ^Co^] = P[e^^ > 

N 


< e-^(^-hCoVN) ^ 1 j 

= + e-\l + g^)) 


N 


< 2"^V2 
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Since —)■ 0 as iV —)■ oo, the last term in the parenthesis is bounded by 2e ^ for N large 

enough. Inserting qjsf and choosing A = logiV, the above is bounded by 

2~^ exp Ilog N -CoVNlogN + 2^ 

< 2 -'^exp I - ic'o\/^logiv}, 

for N large enough. With a union bound over all x G Hat and using the Borel-Cantelli 
lemma, fl3.2p follows. 

To prove fl3.3p . we hrst introduce some notation. For a given vertex x and {zi,..., ik] C 
{1,..., iV}, denote by the vertex that differs from x exactly in coordinates ii,... ,ik- 

If two vertices x and y are at distance 2, then y = x^^ for some /c, I G {1, ... ,N}. Then 
for {i, j} n {k, /} = 0 we dehne the path Yjy of length 6 as {x, x\ = y^^, y^, y}. 

Similarly, for x,y with d{x,y) = 3, we have y = and for {i,j} fl {k,l,m} = 0 we 

dehne the path Yjy of length 7 by {x, x\ ^ yij^ yj^ yy observe that for 

hxed x,y with d{x,y) = 2 or 3 and for different pairs i,j the innermost 3 or 4 vertices of 
the paths YJy disjoint. 

We now show that with high probability, for every x, y at distance 2 or 3, we may hnd 
such that Yjy has only good interior vertices. Fix a pair x,y E Hat at distance 2 or 3, 
and let as above k, I or k, I, m be the coordinates in which x and y differ. Assume for the 
moment that both x and y have at least good neighbors. Then there are at least 

\ClN pairs i, j such that the vertices x* and y^ are good. Moreover, since it is a matter 
of dealing with a constant number of exceptions, we may tacitly assume that i ^ j, and 
{hj} Fl {k, /} = 0 or {i,j} n {k, I, m} = 0 , respectively. 

The remaining interior vertices {x*Ax*A^x*A^ = or x*A^ are 

all good with probability strictly larger than 1 / 2 , so the probability that one or more 
of these vertices are bad is bounded by 15/16. Since these 3 or 4 innermost vertices are 
disjoint for different pairs i,j, by independence, the probability that among all jCqN pairs 
{i,j} there is none for which all innermost 3 or 4 vertices of Yjy good is bounded by 
(15/16)Hence, for one hxed pair x,y E ElAr at distance 2 or 3, where both x and y 
have at least ^Coy/N good neighbors, the probability that there is no path from x to i/ of 
length 6 or 7 with all interior vertices good is bounded by 

(15/16)3^0^. 

There are less than 2^{N‘^ + N^) pairs of vertices at distance 2 or 3 respectively, and we 
know from the proof of 03.21) that with probability larger than 1 — g-cvdviog a ^ 

has at least ^CoVN good neighbors. It follows that the probability that the event in 03.31) 
does not happen is bounded by 

g-cvWiogA ^ 2 ^(iv 2 + Ar 3 )(i 5 /i 6 ) 3 ^o^. ( 3 . 4 ) 

Choosing Cq > \J xogitjw ^PPlyii^g the Borel-Cantelli lemma implies 03.3p . 

We now use the density properties 03.2p and 03.31) of good vertices to dehne a (random) 
mapping from the hypercube to its sub-graph of good vertices. Let 

Vn = {{a;o,... ,Xfc} : fc > 0, d(xi,Xi_i) = 1 V i = 1,..., fc} 

be the set of hnite nearest-neighbor paths on Hat, including paths of length zero, which 
are just single vertices. Dehne the mapping tpjq : {]HlAr,£^Af} —t {HatjT’a} in fhe following 
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way. For a; G Hat, let 

{ X, if X is good; 

x\ if X and x^, j < i, are bad but x* is good; 

X, if X is bad and has no good neighbor. 

By fl3.2p . P-a.s. for N large enough the last option will not be used, and therefore ipN maps 
all vertices to good vertices. In this case, for two neighboring vertices x, y, their good 
images (Pn{.x) and (pN^y) can either coincide, or be at distance 1, 2, or 3. 

For an edge e = {x,y} G let </?Ar(e) be 

• the ‘path’ {y9jv(3^)}, if Pn{x) is good and Pn{x) = 

• the path {pjq{x),pj^{y)}, if both Pn{x) and PN^y) are good and at distance 1 ; 

• the path ‘minimal’ i,j such that all vertices of this path are good, 

if both Pn{x) and PN^y) are good with distance 2 or 3 and such path exists; 

• the path {x, y} in any other case. 

From fl3.2p and 03.31) it follows that P-a.s. for N large enough the last option does not occur 
and pn maps all edges to paths that contain only good vertices. 

Finally, we extend to be a map that sends paths to paths. For 7 = {xq, ..., x„} G Vn 
we dehne (pNil) to be a concatenation of paths 0Ar({xj_i,Xj}), i = 1,... ,n, with possible 
loops erased by an arbitrary hxed loop-erasure algorithm. Note that p^ can make paths 
shorter or longer, but by construction, for any path 7 G Pat, 

|</^y( 7 )I < 7|7|- (3.5) 

We can now construct the random set of paths F that satishes the properties of the 
lemma. We hrst dehne a certain canonical set of paths F, and then use the mapping ^at 
to construct F from F. 

For any pair of vertices x 7 ^ 1 / G Hat, let be the path from x to y obtained by 
consequently hipping the disagreeing coordinates, starting at coordinate 1. These paths 
are all of length smaller or equal to N, and the set F = : x 7 ^ 1 / G Hat} has the 

property that any edge e is used by at most 2^“^ paths in F. Indeed, if e = {u,v}, then 
there is a unique i such that Ui ^ Vi. By construction, e G jxy if 

X • • • ) Xi—li '^ii ■ ■ ■ 1 IIv)) 

y = {vi,..., Vi-i, Vi, yi+i, ..., i/jv). 

It follows that a total of iV — 1 coordinates of x and y are unknown, and so the number of 
possible pairs x,y for paths ^xy through e is bounded by 2^“^ (cf. |DS911 Example 2.2]). 
For any pair x 7 ^ 1 / G Hat, let the path jxy in the set F be dehned by 

'(l>N{,%y), if X, y are good, 

{x} o if X is bad and y is good, 

xy — ^ 

(pNilxy) ° {y}-, if X is good and y is bad, 

^{x} O (pMilxy) o {y}, if X is good and y is bad, 
where ‘o’ denotes the path concatenation. 

It remains to check that this set of paths F indeed satishes the required properties. First, 
by construction, F is complete, that is every path '-yxy G F connects x with y and is nearest- 
neighbor and self-avoiding. Further, by construction oi pn and the properties fl3.2p and 
fl3.3p . P-a.s. for N large enough, all interior vertices of all 7 G F are good, i.e. (i) is satished. 
Moreover, by 03.51) and the construction of the paths 7 G F, the paths 7 G F have length 
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at most 7N + 2, hence (ii) is satisfied for N > 2. Finally, (pN deforms the paths 7 ^ T 
only locally, so that the number of paths in F passing through an edge e is bounded by 
the number of paths in F passing through the ball of radius 4 around e. But this number 
is bounded by 2 ^“^ times the number of edges in that ball, which is bounded by for 

some integer K > 0. This proves (iii) and thus finishes the proof of the lemma. □ 


We can now prove the spectral gap estimate. 


Proof of Proposition 1,9. il P-a.s. for every N large enough we can find a complete set of 
paths F such that (i), (ii) and (iii) of Lemma [3.21 and fl2.3p hold. By (ii), the expression in 
03.1 p over which the maximum is taken is bounded from above by 


8N 1 


(r^Al)(rj,Al). 

'rxyHu,v} 


(3.6) 


We distinguish three cases for the position of the edge {u,v} in a path '-)xy 

(1) If {u, v} is an interior edge of 'y^y, then TuAt^ is larger than by (i) of Lemma 

(2) If {u, v} is at the end of the path 'y^y, say ai u = x, and v is an interior vertex of 73 ,^,, 

then Pr A Ty is either larger than or it is equal to in which case it cancels 

with Tx A I- Indeed, if A Ty was smaller than and equal to Ty, then v would 

be a bad interior vertex of 7 , which contradicts (i) of Lemma [3.21 

(3) If jxy only consists of the single edge {x, y}, then Tx A Ty is either larger than 1, or the 
term Tx A Ty cancels with the smaller one of Te A 1 and Ty A 1. 

It follows that for every edge {u, n} the expression fl3.6p is bounded from above by 

SN 

iV^'^°#{paths through e}. 

Zn 

Since, by (iii) of Lemma l3^ the number of paths is bounded by N^2^~^, and, by fl2.3p . 

Zn P k2^, this completes the proof. □ 


In a next step we construct the mixing time Tmix of the fast chain Y. To this end, define 
the mixing scale 

tun = -iV-^+3+/^c'o_ ('37) 

K, 

Then Proposition 13.11 reads Xn P 2N‘^mf^. 

We assume that our probability space (hi, 7F, P) is rich enough so that there exist infin¬ 
itely many independent uniformly on [ 0 , 1 ] distributed random variables, independent of 
anything else. A randomized stopping time T is a positive random variable such that the 
event {T < t} depends only on {Yg '■ s < t}, the environment, and on the values of these 
additional random variables. 


Proposition 3.3. P-a.s. forN large enough, there exists a randomized stopping time Tmi x 
with values in {mjv, 2mN, 3mN, • • • } such that Tmix is a strong stationary time for Y, that 
is for any (possibly random) Yq G EItv, 


(%) = y] = Uy, 

(ii) for any k>l, PyjTmix > kmN] = 

(iii) Tmix and Frmix independent. 
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Proof. This construction follows closely |MM151 Proposition 3.1], with only minor adapta¬ 
tions. Dehne the following distances from stationarity, 

s{t) = min{s > 0 : Vx,?/ G Mn, Pf[Yt = i/j > (1 - s)z/(i/)}, 

d{t) = max \\Pf[Y, e ■]-P;[Y, G - jUTY, 

where || • \\tv denotes the total variation distance. Dehne the time 

T = inf{f > 0 : d{t) < e~^}. 

From |AF021 Lemmas 4.5, 4.6 and 4.23] we know that 

d{t) < 

s{2t) < 1 — (1 — d{t)Y, 


X 


Y 


T < — (1 + - log —] , 


(3.8) 


where u* = Since P[ra, < e ■^^] < ce by the Borel-Cantelli lemma, P-a.s. for 

N large enough, log ^ < N'^. Therefore, by Proposition 13.11 and 03.81) . P-a.s. for N large 
enough, T < |mAr, d{\m]^) < e“^, and < e~^, which means that for all Yq, y G ElAr, 

= I/] > (1 - e~^)^y 

We can now dehne the strong stationary time Tmix with values in {mAr, 2mAr,... }. Let 
Ui, U 2 ,... be i.i.d. uniformly on [0,1] distributed random variables, independent of any¬ 
thing else. Conditionally on Yq = x, Ymj.^ = y, let Tmix = mAr if 




£/.< 


(< 1 ). 


Px\Xini.i — y\ 

Otherwise, we dehne T^ix inductively: for every /c G N, conditionally on Tmix > 
Ykmi, = Z and Y(^k+i)mt, = V, let T^ix = {k + l)mN if 

(1 - 


= y] 

By construction, we have for every a: G El at. 


(< 1 ). 




P;[T^ix = rriN I Y^, = y] = ^ 


and thus 


Pi[ym^ = yy 

PyQ^Pmix V I ^0 (1 ^ )^S/* 


Similarly, we have 

PyQ [Pmix (.k p l)?TT.Ar, Y(^k+l)m^ 2/ | Tvciix P kni]\l, Ffcmjv ^ ')^y 

By induction over fc, we obtain that for any A: G N and y G Hat, 


= krrif^, Ykmj., = v] = e ^^(1 - e ^)z/, 


2 /’ 


which hnishes the proof. 


□ 


For future reference we collect here two useful statements that follow directly from the 
construction of Tnix. 
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Lemma 3.4. For every t > 0 and x G Mn and every starting distribution p, 


Pp\Yt = xlTinix <t]= 

\P^[Yt = x]- >t]= 


4. Bounds on mean hitting time and random scale 


In this section we prove bonnds on the mean hitting time E^lH^] of deep traps x G Pat. 
As a corollary of the proof we will obtain a useful bound on the Green function in deep 
traps. The bounds on the mean hitting times will further imply bounds on the random 
scale Rn, which will imply the claim fll.7p of Theorem 11.11 


Proposition 4.1. There exists 6 G (0,1/6), such that P-a.s. for N large enough, 

< El[H^] < 2^+^'"' for every xeVn. 

The proof of Proposition 14.11 is split in two parts. 

Proof of the upper hound. For the upper bound we use [AFn21 Lemma 3.17] which states 
that 

Since Tj, > 1 for deep traps x G Vni this is smaller than which by Proposition 13.11 and 
fl2.3p is bounded by 2^+^^ ^ P-a.s. for N large enough. □ 


For the lower bound we will use a version of Proposition 3.2 of [CTWllj which allows to 
bound the inverse of the mean hitting time El[p[x\ in terms of the effective conductance 
from X to a suitable set B. Recall the dehnition of the conductances Cxy from 02.41) . and 
let Cx = J2yr^x Following the terminology of |LP141 Chapter 2], we dehne the effective 
conductance between a vertex x and a set B as 


C{x ^B) = P-Ih: > Hb]c,. 


By Proposition lA.ll which is a generalization of [CTWll 
continuous-time hnite-state-space Markov chains, 


Proposition 3.2] to arbitrary 


1 

E^] 


<C{x B)n{B)-^. 


(4.1) 


To apply this bound effectively, we should hnd a set B such that C{x —)■ B) is small 
and ^{B) close to 1. In the next lemma we construct such sets B for every x G Hat. For 
these sets we have some control on the conductances connecting B and B'^. Using standard 
network reduction techniques we can then give a bound on the effective conductance C(x —)■ 
B), which when plugged into fl4.ip will imply the lower bound on 

Denote by B{x,r) = {y E Hat, d{x,y) < r} the ball of radius r around x, and by 
dB{x, r) = {y E Hat, d{x, y) = r} the sphere of radius r. 


Lemma 4.2. For every 5 G (0,1/6), P-a.s. for N large enough, there exist radii {px)xmN 
satisfying 1 < Px < , such that for all x E Hat and for all y E dB{x, px), Ty < 2 2 ^^ . 

Proof. Fix <5 E (0,1/6). We say that a sphere dB(x,r) is good if Ty < 22 ^^ for all 
y E dB{x, r), otherwise we say that it is bad. Using the Gaussian tail approximation fl2.2p . 
we get that 
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The size of the sphere dB{x,r) is bounded by N^, hence the probability that the sphere 
dB{x,r) is bad is bounded by 

N^¥[Ty > < cexp{r\ogN 

By independence of the r^, the probability that for one hxed x all the spheres dB{x,r), 
r = are bad is bounded by 

/ N 

r=l 

< exp log c + log N - }. 

8p 

Finally, by a union bound, the probability that among all 2^ vertices in El at there is one 
for which all spheres dB{x, r), r = 1, are bad is bounded by 

2N <exp{N^^\ogc + N^^\ogN + N\og2-^^^N^+^}. 

Since 5 < 1/6 this decays faster than exponentially, and so by the Borel-Cantelli lemma 
the event occurs P-a.s. only for hnitely many N, i.e. P-a.s. for N large enough we can hnd 
for every x G Hat a radius px < such that the sphere dB{x, px) is good. □ 

Proof of the lower bound of Proposition [/TT} For every x G T>n we dehne the set Ax = 
B{x, Px) if the radius px from Lemma IT^ exists. otherwise we take Ax = {x}. By Lemma 
and fl2.3p . P-a.s. for N large enough, for all x G Vj^ all conductances Cyz = {xy A Tz)/Zm 
connecting Ax and A^ are smaller than 22 ^^ /{k2^). 

By the parallel law (cf. |LP14[ Chapter 2.3]), the effective conductance between the 
boundaries of Ax and is equal to the sum of all the conductances of edges connecting 
Ax and A^, and so P-a.s. for N large enough, 

C(dAx ^ dA^x) = ^ Cyz < 

y&dAx 

z&dA% 

By Rayleigh’s monotonicity principle (cf. |LP141 Chapter 2.4]), comparing the effective 
conductances from x to before and after setting all the conductances inside Ax to 
inhnity, it follows that 

C(x ^ Al) < C{dAx dAl) < . 

Since <5 < 1/6 and px < , we have < 2^^^ ^ for N large enough, and thus, 

P-a.s. for N large enough, 

C(x ^ Al) < c2-^+^"~\ (4.2) 

Moreover, P-a.s. for N large enough, as Uy = {1 A Ty)/ZN < l/Z^, using fl2.3p again, 

n{Al) = 1 - u{Ax) > 1 - Z],^\Ax\ > 1 - c2-^N^"' 1. (4.3) 

Plugging fl4.2p and fl4.3p into 04.1 p and readjusting 6 to accommodate for constants easily 
yields the required lower bound El[Hx] > 2^“'^^ . This completes the proof. □ 

As a corollary we get a lower bound on (a:)] for the deep traps x G Vn- 
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Corollary 4.3. There are constants 6 G (0,1/6) and c> 0, such that P-a.s. for N large 
enough, for all x G V^, under Pf the local time of Y in x before leaving A^, 

stochastically dominates an exponential random variable with mean c2~^^ ^. In particular, 
P-a.s. for N large enough, 

G [&..(!)] > 

Proof. The local time at x before hitting is an exponential random variable with mean 
equal to 

E^l^^/visits to X before ■ El,[Jf\. 

The expected number of visits before leaving is Pf.[Hfi > = CxC{x A%)~^. 

The mean duration of one visit to x is El,[.Jf\ = ■ For the deep traps we have 

Tx > 1, therefore = Ylyr^x^^yl^^ = Z^Cx. It follows that the local time at x 

before hitting is in fact an exponential random variable with mean Zjs[^C{x 
Using the bounds fl4.2p and 02.31) . the claim follows easily. □ 


As a next consequence we give bounds on the random scale dehned in 02 .1Up . Note 
that this lemma also proves the statement 01 .7p about the asymptotic behavior of i?Ar in 
Theorem 11.11 


Lemma 4.4. For every e > 0, P-a.s. for N large enough, 

2F-e)N 

Proof. By Proposition 13.31 Tm^ ^/mN is a geometric random variable with parameter e“^, 
and thus ^ -Fx[Fmix“] < cm^ < e^^ by 03.71) . for every e > 0 and N large 

enough. Moreover, [Dn\ < c'2 F“'T'py f|2.8p . Using the lower bound on El[Hx] from 
Proposition 14.11 we obtain that for every e > 0, P-a.s. for N large enough, 





-1 


For the upper bound we need a lower bound on Recall the sets Ax con¬ 

structed in the proof of Proposition 14.11 and note that 

w“]. (4.4) 

By Corollary 14.31 P-a.s. for N large enough, the local time at x before hitting stochas¬ 
tically dominates an exponential random variable with mean c2~^ , hence 

Pf[lH,,{x) < 2-2^^-^ < 1 - < c2-^^-^ 

Moreover, for every e > 0, P-a.s. for N large enough. 
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Using the last two observations in fl4.4jl . P-a.s. for N large enough, 

> > 2-"'^“'] - p;[{<«,. (X) > 2 -™“'} n {T„,, < Ha,}]) 

> W a < Ha^S) 

> 1(1 - c'2-'^'-‘) - 2-") 

> 2 "^^. 

Combining this with |Pv| ^ by 02.81) and the upper bound on El[Hx] from 

Proposition 14.11 we obtain the required upper bound on i?Ar. □ 


5. Concentration of the local time functional 


In this section we prove the concentration of the local time functional that appears in 
the computation of the quasi-annealed Laplace transform of the clock process on the deep 
traps, as explained in the introduction (cf. fll.hp L We denote this functional by 

LM{t) = W(^)“ 

xGVjsr 

So far we had no restriction on the choice of 7 ' other than 1/2 < 7 ' < 7 , see fl 2 . 6 p . 
We now make an explicit choice as follows. Let £0 = | ((1 “ 7 ) A (7 — ^)), and dehne 
7' = 7 — such that in particular 

1-7 >2^0, (5.1) 

7 - 7 ' = Eq. (5.2) 

The main result of this section is the following proposition. 


Proposition 5.1. For every fixed t > 0, P-a.s. for N large enough, 


PJ. 


\LN{t)-t\ > 




Proof. We approximate Lj^it) by the sum of essentially independent random variables as 
follows. Let K = . For a hxed f > 0, dehne 




K 


-k, k = 0,..., K. 


pk 

^t,x 


Recall the notation fl2.5p . For every x G and k = 1,..., K, dehne = tk-i + H^ °K-i 
to be the time of the hrst visit to x after ffc-i, and set 

r{H^+N^mN)A(tk-N^mN) '' “ 

/ 'i-{Ys=x}ds 

' H^A{tk-2N2mN) 

The random variable gives ‘roughly’ the a-th power of the time that Y spends in x 
between tk-i and tk — with some suitable truncations. Let further 

= 2»-'A ^ 

xSVpf 

The next lemma, which we prove later, shows that the sum of the f/^(f)’s is a good 
approximation for 
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Lemma 5.2. For every t > 0, P-a.s. for N large enough, 

K 


PJ. 




k=l 


< c2 2 




With Lemma 15.21 the proof of the proposition reduces to understanding of the approx¬ 
imating sum We will compute its expectation and variance under Pf. In 

particular, we will show that there is c < oo such that for every f > 0 , 

K 




t 


k=l 


< c2 


-26oN 


and 


K 




k=l 


P-a.s. as oo, 


P-a.s. as iV —)■ oo. 


(6.3) 


(5.4) 


The statement of the proposition then follows from Lemma 15.21 fl5.3p and fl5.4p by routine 
application of the Chebyshev inequality. Indeed, P-a.s. for N large enough. 


P.T 


\LN{t)-t\ > 2-5"°^ 


< P.T 


K 




k=l 


+ Pf 


K 


r K 




k=l 


k=l 


>2-2-5 


-hoN 


>0—-rh^oY 


< c 2 ~ 2 ^°^ -F c ' 2 ~ w ^°^ < c"2-*^ 

which is the claim of the proposition. 

We proceed by computing the expectation fl5.3p . We will need two lemmas which we 
show later. The hrst lemma estimates the probability that a deep trap is visited by the 
process Y. 

Lemma 5.3. For every tjsf such that 1 < fv < 2^, for every e > 0, P-a.s. for N large 
enough, for all x G Pat, 

P:[H. < tN] = + 0 ( 422 (--d^) + 

The second lemma then gives the expected contribution of a single f'jj, to 'Yl!k=i ^v(^)- 

Lemma 5.4. For every fixed t > 0, k = 1,..., K and e > 0, P-a.s. for N large enough, 
for all X G V^, 

tR]^ 


Kifi] = 




KEllH, 

With Lemma [5^ it is easy to compute the expectation fl5.3p . Using that {VnI < c2‘d~'^''>^ 
by (ESD, and the dehnition 02.1 Op of i?Ar, for every e > 0, P-a.s. for N large enough, 

r A n K 


k=l 


YuY) =2'"'-’'''EEvii-M/4 

xeUjv k=l ^ ^ 


tR 


N 


■EUe, 


Q ^2^2-y+^e-2s^-2)N\ 


= t+ 0(2^^-^+^^-=°)^). 

Choosing e < Eq/S and recalling 05.ip implies 05.3p . 
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Next, we estimate the variance 05.41) . Since v is the stationary measure for Y, the random 
variables k = 1,... ,K, are identically distributed under PJ". Hence 

=-ffVar:(G(i))+2 Z Cov^f/tW. (/t W) ■ (5.5) 

^ k=l ' \<k<i<K 

The covariances can be neglected easily. Indeed, since by dehnition U%{t) depends on the 
trajectory of Y between times tk-i and tk — N’^rriN only, we can use the Markov property 
at the later time to write 


Co-^l (uUt),uk(t)) = E:[{ul,m - (5.6) 

By Lemma [231 . Using in addition that 

for some sufficiently large c', we see that the inner expectation satishes 


Inserting this inequality back to 05.6p and summing over k < j then implies that the second 
term in 05.51) is 0{e~'^^ ) and thus can be neglected when proving 05.41) . 

To control the variance of U\}{t) in 05.51) . it is enough to bound its second moment, which 
is 

Since, by dehnition, < N’^rriN and f'ta, 7 ^ 0 implies < tR^/K, 



. xeDjv 


H. < 


tR 


■N 


K 


E T 

x^yeVff 


H H < ^ 

Ex, Ey < ^ 


By Lemma [5.31 and Lemma [4.41 for every e > 0, P-a.s. as iV —)■ 00 , 

tR]\i 


P! 


Hx < 


K 




(5.7) 


(5.8) 


Moreover, by 02.81) . |PAr| < and by 03.7p . < 2^^, for every e > 0 and 

N large enough. It follows that the contribution of the hrst sum in 05.71) to the variance, 
including the prefactor K = 2^°^ from 05.5p . can be bounded by 

^ 2 ( 2 ( 7 '-7)+l-7'+7-l+2e)Y _ ^2(7'-7+2e)Y 


By 05.21) . 7 ' —7 + 26 < —eQ + 2e < —^Sq for e < £o/4, and hence this contribution is smaller 
than c2~^^°^ as required for 05.4p . 

For the second summation in 05.7p we write 


P^ 


tR.M 


tR]\f 


tR]si 

lA 

<p: 

Ex < Ey < 

K 

+ P^ 

' U 

Ey < Ex < 

K 
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By the Markov property, each of these two probabilities can be bounded by 

tRj\f 




Hx < Hy < 


tR]y 


p:[Hx e du]P^ 


'0 


tPN 

Hy < U 




< j p:ih, € du] (< T„ij + p, 

tPj^ 


H < 


< P^ 


Hx < 


K 


P^[Hy < T^ij + p; 


y - 


Using fl5.8p and fl2.8p again, the second sum in fl5.7p is bounded by 

^2(7-l+s-£0)Yj'22(l-y)N2(7-l+£-£0)Y^ ^ p;[hrj/ < (5.9) 

x^yeVN 

The hrst term in the parentheses of 05.91) together with the prefactors K from 05.51) and 
22 (7'-7)YjY4a^^ < from 05.7^ . contributes to the variance by at most 

g2fro+2(7'-7)+e+2(l-7')+2(7-l+e-eo))A _ ^2(3e-eo)Y ^ ^2“5SoN 

if £ is small enough, as required by 05.4p . 

For the second term in the parentheses of 05.9p we need the following lemma whose proof 
is again postponed. 

Lemma 5.5. Let Wf = J2yeVN y¥^x^{Hy<t}- Then for every e > 0, P-a.s. for N large 
enough, for every x G Pat, 

Using Lemma 15.51 and including all the prefactors as before, the contribution of the 
second term in 05.9p to the variance 05.51) is bounded by 

^2(eo+2(7'-7)+e+l-7'+7-l+£-£o+e)Y _ ^2(7'-7+3£)Y ^ c^-^sqN 

where for the last inequality we used 05.2p again, and choose e small enough. This completes 
the proof of 05.41) and thus of the proposition. □ 

We proceed by proving the lemmas used in the above proof. 


Proof of Lemma \5. ,91 By [AB921 Theorem 1] the hitting time Hx is approximately expo¬ 
nential in the sense that 


Pf[Hx>t]-e 


< 


1 


XYEf[HxY 

Hence, using Propositions 13.11 and 14.11 to bound Ay and Ef[Hx] respectively, we have for 
every e > 0, P-a.s. for N large enough, 

Pf[Hx < tN] = (1 - e-m) + 0 ( 2 (^-i)'^) 


tw 


El[Hx 




Finally, if 1 < tv < 2^ this is bounded by ctv2^^ which proves the lemma. 


Proof of Lemma \5.4\ By the strong Markov property and the dehnition of 

> p;[p. e 14-1,4 - 

6 14-1,4 - N^m„]]Ei[ty,„„(xr]. 


□ 


(5.10) 
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We will now give approximations of the expressions appearing in fl5.10j) . 

Observe that for every s, f > 0, 

itixr < Uxr + - e,{x)r. 

Using this ineqnality with t = and s = Tj^ix and applying the strong Markov 

property at T^ix, observing that is i^-distribnted, 

E:[l!„2^Jxr] < EllUr^Jxr] + E;[e„,^Jxr]. 

By Lemma 15731 using also that by fl3.7l) . (.N^mN^xY < < 2^^ for every e > 0 and 

N large enough, 

E:[iN2^^{xr] < P:[H. < 

Hence we obtain the upper bound 

e; [v™„(a^)1 < e; [iT^d^r] + (6.11) 

For a matching lower bound, note that 
But from Proposition 13.31 it follows that 

CX) 

k=N‘^ 

SO that 

El [iw^^JxT] > El - ce-'"'. (5.12) 

Combining 05.111) and 05.12p . we obtain 

E:[«a-wW“] =BdT.i.(2^)1 +0(2<='—)"). (5.13) 

Note also that by 03.7p . for every e > 0 and N large enough, 

B:I<t„,.(i)”] < < cm)i < 2'«. (5.14) 

To approximate the probabilities in 05.101) . we apply Lemma 15.31 for t at = tk-i and 
tN = tk — iN'^misf, for a hxed f > 0 and i = 1,2. Using Lemma 03] to bound Rn and 
Proposition 14.1! to bound E^Hx], for every e > 0, P-a.s. for N large enough, for both 
* = 1 , 2 , 

P:[Hx e [4-1,4 - tN^m^]] = + 0(22U+---o-bA^) = (5.15) 

Inserting both 05.15!) and 05.13!) in 05.101) . and using 05.14p . for every £ > 0, P-a.s. for N 
large enough. 


This proves the lemma. 

Proof of Lemma \5.2 . Note hrst that 


□ 
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To control the probability of this event, we introduce some more notation. Set = 0, 
= Hx, and for k >2 dehne the time of the ‘fc-th visit after mixing’ inductively as 

= inf{t > o ; Yt = x}. 

Let Afj^ = min{/c > 0, < t} be the number of ‘visits after mixing’ to x before time t. 

Finally, let Ik = [tk — 2,N‘^niN,tk]- Then 

K r K 




3x eV 


N ■ ItRj^ [X) 




k=l 


<p: 


Ys G Vn for some s G 


k=l 


(5.16) 


+ P:[3xeV^-. KRr.>^] 

+ PI [3a; G Vn ■ o Oh, > N'^itih] ■ 

We show that each of the three terms on the right-hand side is smaller than , which 

will prove the lemma. 

For the hrst term in fl5.16p . using the stationarity of z/ and the Markov property 

K n 


PI, 


(5.17) 


Yg eVh for some s G [H^<2N^mH\. 

k=l x^T>n 

By Lemma [5.31 P-a.s. for all x G Pn, for e > 0 small and N large enough, 

P:[H^ < 2N^mH] < 2^^-^^^. 

Since \Vh\ < c 2 d“T'')N f |2 sp . the right hand side of (15.17p is bounded by c 2 ^oN 2 i£-i')N^ 

Since 7 ' > 1/2 and by dehnition £0 < 1/4, when £ is small enough this is smaller than 
c 2 - 2 ^o^ as required. 

For the second term in (I5.16p . by Lemma [5.31 and the strong Markov property at T^ix, 
for every £ > 0, P-a.s. for N large enough, 

< tRn] < P:[H, < iRnf < . 

Together with (12. 8 p to bound \Pn\, and using (15.ip and (15.21) . P-a.s. for N large enough, 
P; [3a: G Pjv : MtRj, > 2] < c2^^-'i')n 


as required. 

Finally we give a bound on the third term in (I5.16p . By Proposition 13.31 PJ[Tmix > 
. Thus, with (12. 8 p to bound \T>n\, P-a.s. for N large enough, 

P; [3a: G Vh : o Oh, > N^mn] < c 2 ('-'^')Npr ^ ^ 2 ^^] 

< c'2~^^°^. 

Together with the previous estimates, this implies that the right-hand side of (I5.16P is 
bounded by , and concludes the proof of the lemma. □ 

Proof of Lemma 15.51 Let Rq = 0 and dehne recursively for i > 1 

n, = mf{t > : y* G \ {Yh,_,}}. 

By (12.9p . P-a.s. for N large enough, the vertices in Pjv are at least distance 5N from 
each other. In particular the balls = B{x,px), x G Pat, constructed in Lemma IT^ are 
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disjoint. Hence, when on y G Pat, the random walk Y should hrst leave Ay in order to visit 
VN\{y}. The strong Markov property and Corollary 14.31 then imply that Hi stochastically 
dominates a Gamma random variable with parameters i and /r := 

If Wf > i, then Hi < t. Hence, for t > fx, 

El[Wt] = > i\ < < i] < E 

i>l i>l i>l 

It follows that 

Y't...] S > Nha„] < < 2*'' 

by 02.81) ■ 03.71) and Proposition 13.31 This completes the proof. □ 

For later applications, we state two further consequences of the proof of Lemma 15.21 


Lemma 5.6. P-a.s. for N large enough, 

Pf\^x G Vn : < c2"^^°^, 

and 


PI. 


G • Hx Y t/^A'"} ^22 




< c2 


-YoN 


Proof. The hrst claim follows directly from the bounds on the second and third term on 
the right hand side of 05.16p in the proof of Lemma 15.21 since the local time in a vertex 
that is only ‘visited once after mixing’ is bounded by T^ix ° Oh,,,- 

The second assertion can be seen in the following way. Using Lemma [5.31 to bound the 
probability of a single vertex a; G PAf to be visited before time tR^ and fl2.8p to bound the 
size of Pn, for every e > 0, P-a.s. for N large enough, 

Ef[\{x G T>jv : Hx< < c2^'r-i'+e)N_ 

By fl5.2p this is equal to so choosing e < Sq/A this is smaller than c24^°'^. Then 

by the Markov inequality the probability that there are more than 2^^°^ vertices visited is 
smaller than c2“4^o^. □ 


6. Clock process of the deep traps 


This section contains the main steps leading to the proof of Theorem 11.11 Recall from 
fll.8p that the ‘clock process of deep traps’ S-d is given by 

Svit) = / {IV TYjl{YseVN}ds = / TY,l{Ys&VN}dS- 

Jo Jo 

We now show that Sd converges to a stable process. 


Proposition 6.1. Under the assumptions of Theorem M . A the rescaled clock processes of 
the deep traps gfj^ST>{tRN) converge in ^-probability as N ^ oo, in Pf distribution on the 
space Z1([0,T],M) eguipped with the Skorohod Mi-topology, to an a-stable subordinator Va- 

The proof of Proposition 16.11 consists of three steps. In a hrst step, we show convergence 
in distribution of one-dimensional marginals by showing that the Laplace transform of one¬ 
dimensional marginals converges. This step contains, to some extent, the principal insight 
of this paper and is split in two parts: We hrst show the quasi-annealed convergence 
mentioned in the introduction, which is then strengthened to convergence in probability 
with respect to the environment. The second and third step of the proof of Proposition 16.II 
are rather standard and deal with the joint convergence of increments and the tightness. 
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6.1. Quasi-annealed convergence. We establish here the connection between the Lap¬ 
lace transform of the clock process of deep traps and the local time functional studied 
in Section [5l The key observation is that the depths of the deep traps are in some sense 
independent of the fast chain F, and can be thus averaged out easily. 

To formalize this, we introduce a two-step procedure to sample the environment r. Let 
i = ('Ca;)xeHjv be i.i.d. Bernoulli random variables such that, cf. fl2.7p . 

= 1] = 1 - P[e, = 0] = P[x e V^] = 2-^'^{I + o(l)). 

Further, let E = (£'x)xeH^ be i.i.d. standard Gaussian random variables conditioned to be 
larger than and ^ = (^)xeHjv bi.d. standard Gaussian random variables condi¬ 
tioned to be smaller than The collections E and ^ are mutually independent. 

The Hamiltonian of the REM can be obtained by setting 

Ex = ( 6 . 1 ) 

From now on, we always assume that E^ are given by fl6.ip . Observe that in this procedure 
the set "Dat coincides with the set {x G Hjv : = !}• 

We use Q = to denote the cr-algebra generated by the ^’s and E’s. In particular, 

the number and positions of deep traps and ah the Ty, y ^ Vn, are ^-measurable. The 
depths of deep traps are however independent of Q. 

In the next lemma we compute the quasi-annealed Laplace transform of Sv- The term 
‘quasi-annealed’ refers to the fact that we average over the energies of the deep traps Ex 
(and over the law of the process), but we keep quenched the positions of the deep traps ^x 
and the energies of remaining traps 


Lemma 6.2. There is a constant K G (0, cxo) such that for every A > 0 and t > 0, 



A 

9N 


SvitRN) 





P-a.s. 


Proof. Recall the separation event ^ dehned in 02.91) . This event depends only on f and 
is therefore ^-measurable, and by Lemma 12.11 it occurs P-a.s. for N large enough. On 
no deep traps x G Pv are neighbors. Since moreover Xj, > 1 for x G "DAr, ah the transition 
rates 

Fr A Ty 

qxy^y = -r- -Ijz’, x,y e Mat, 

1 A 

are ^-measurable. That is, on the event the law of the chain Y is in fact ^-measurable. 
Therefore, on the order of taking expectations over the depth of the deep traps and 
the chain Y can be exchanged. Namely, denoting by E the expectation over the random 
variables Ex, on 


E 




= El 


= El 


= El 




exp 


9n Jo 
A 




XYs'^{Ys&VN}ds 


expi- itRN{x)Tx 

9n ^ 


( 6 . 2 ) 


We next approximate the inner expectation on the right-hand side of fl6.2p . Since its 
argument is bounded by one, it will be sufhcient to control it on an event of PjJ-probabihty 
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tending to 1 as TV —)■ cxo. Define the event 

A = {for all X G V^, itRj^{x) < iV^mTv} H ||LAr(t) — ( 6 . 3 ) 

By Proposition 15.11 and Lemma [5.61 P-a.s. for N large enough, P^[A'^] < . 

When performing the inner expectation of fl6.2p . the local times itR^{x) of Y as well 
as Vn are hxed, the expectation is taken only over the energies of the deep traps. By 
independence of the it follows that 


E 




n E 

a:eI5jv 


exp <j — —itRN{x)e 
9n 




= exp 


^ logE 
. xeDjv 


exp 


9n 


i.tRi,{x)e 




(6.4) 


For u G [0, let 


!?(«) = 1 — E 


exp 


-ue 




9n 


Since {EAj has standard Gaussian distribution conditioned on being larger than \ogg'j^, 
using that by fl2.7p . 

^ logS'Tvl = e Vn] = + 0 ( 1 )), 


P 


E^. > 




it follows that 


(yy' N 




Xu ^0 \/Ns 


e 2 11 — e SN 


ds. 


We use the substitution s = -^^(/T^^ + log —log A —logn). The lower limit of the integral 
then becomes 

^(logS'v - logS'v + logA + logu) =: u:{N). 

For u < N’^rriN, ojn is asymptotically dominated by logf^^y — \oggN < —ciV, and thus 
limAr^oo<^(^) = — 00 . After the substitution, 


07 ^ 

= ^^(1 + 0(1)) 

yj lE 


'uj{N) 




For u G [0, using the dehnition fll.bp of g^, the exponent of the hrst exponential 

satishes 

1 




{13z + log gN - log A - log uY 

= ~ ~ \og{aYV2EN) - log A - logu)^ 


( 6 . 6 ) 




-N + a log A + a logu + log(a/9\/27riV) — a^z + err( 2 ;) + o(l). 


Here, o(l) is an error independent of the variable z. Note that for the log^u part to be 
0 ( 1 ) it is important that itin dehned in 03.71) is not too large, see also Remark 16.41 The 
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second error term is 


erru = 


) = + ^^(^logM^/27riV) + logA + logM). 


Observe that limAr^oo err( 2 ;) = 0 for every 2 ; G M, and that for every e there is Nq large 
enough, so that for N > Nq and all 2 ; G M 


err( 2 ;) < e\z\ 


(6.7) 


Inserting the results of the computation 06.61) back into 06.Sp . using that 0^(3"^/2 = 7 log 2, 
we obtain 

/■oo 

d{u) = (1 - (1 + o(l)). ( 6 . 8 ) 

Jw[N) ^ ^ 

We now claim that 

f ^-af}z+err[z) O _ f A _ 

Juj{N) ^ Jm. ^ 


dz =: a 


(6.9) 


Indeed, the integrand converges point-wise on M to e — e which is integrable 

if a < 1. Moreover, by 06.7p . the integrand is bounded by which 

is integrable if we choose e < /3(1 — a) A a(3. The claim 06. 9 p follows by the dominated 
convergence theorem. 

We now come back to 06.4p . Since on M, £tRj^{x) < N^rrif^ for all x G "Pat, and 7 ' < 7 , 
we see that 'd{£tRj^{x)) = o(l) uniformly in a; G "Pat on A. With log(l — x) = —a;(l -|- 0{x)) 
as x —)■ 0 this yields 


E 




exp log (1 - 


xGVn 


= exp 


^{£tRj,{x)){l + O 

xSDjv 


The inner sum can be easily computed from 


Recalling that on A the local time 


functional L^iit) converges, denoting /C = ck/SC, we obtain on M, 

i9(W(^)) = £tR^{xr{l + o{l)) 

= a/3C'A“L7v(t)(l + o(l)) 

= /CA"t -I- 0 ( 1 ) as iV —)■ cx). 

It follows that on A 


( 6 . 10 ) 


E 


(Tm^AtRN) 


_ ( 1 + 0 ( 1 )) ^ ^-KtX 


= e 


-\- 0 ( 1 ) as 00 . 


Inserting this into 06.2p . using that = 0(e '^^), we conclude that, on JA, P-a.s. as 

TV —)■ cx. 


E 




G 


= El 


E 




U 


+ 0{e-'^^) = + 0 ( 1 ). 


Since occurs P-a.s. for N large enough, this completes the proof. 


□ 
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6.2. Quenched convergence. We strengthen the convergence in Lemma [6.21 in the fol¬ 
lowing way. 

Lemma 6.3. The one-dimensional marginals of the rescaled clock processes gfj^S-DitRjy) 
converge in ¥-probability as N ^ oo, in Pf-distribution to an a-stable law, that is for 
every t > 0 and A > 0, 


E 


T 

V 







in ¥-probability. 


Proof. It will be enough to show that P-a.s. for N large enough, 



ltST>{tRN) 



^ g-2^A“t 


+ 0 ( 1 ). 


( 6 . 11 ) 


Indeed, if fl6.1ip holds, then the conditional variance 


Var 




N^oo 


» 0, P-a.s., 


and the claim follows by an application of the Chebyshev inequality and Lemma 16.21 
To show fl6.1ip . we rewrite 


E 


r:[ 




T 


g 


= E 






where and are the local times of two independent Markov chains and both 
having law Pf, and Ef is the expectation with respect to the joint law Pf of these chains. 
Again P-a.s. for N large enough the separation event holds, and on this event the law 
Pf is ^-measurable. Therefore we can exchange the expectations similarly as before. As 
in Lemma 16.21 it will be enough to control the expression on an event of PJ'-probability 

tending to 1 as — ?■ cxd. We thus set A = fl where are dehned for both chains 

yW 

as in fl6.3l) . Applying Proposition 15.11 and Lemma [5.61 for both independent chains, we 
have that P-a.s. as N ^ oo, Pf[A'^] = 0{e~'^^). 

Let C be the event that Y^^l and Y^‘^1 visit disjoint sets of deep traps, 

C = |{a; e Pjv : > 0}n{x : 42^()r) > 0 } = 0| . 


We claim that Pf[C’^] = 0(e P-a.s. as iV —)■ oo. Indeed, by Lemmawith proba¬ 
bility larger than 1 — c2“4^o^, chain Y^^'> visits at most 2^^°^ different vertices in T>n- 

By Lemma 15.31 each of those vertices has probability smaller than c2^'y-^+^)^ of being hit 
by Y^‘^\ for every e > 0, P-a.s. for N large enough. Therefore by the choice fl5.ll) of eo, 
P-a.s. for N large enough, 

< c2"L^°^ -F c'2"^^°^+^^, 

which decays exponentially if e < ^o/2. 
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Since on C the of the vertices x G visited by and are independent, and 
since the integrand is bounded by 1, we have on the separation event 

2 


E 


El 

= El 

= k 
= k, 




e~k 

V- .( 1 ) 1 _ 

E 


■-AnC 


+ 0 (e-^'") 


e gNExeVj^kffi^Y- 


e 9^ 


■-Anc 


+ 0 (e-^"). 


Using the same procedure as in the proof of Lemma 16.21 on the event A, the two inner 
expectations, x G Ptv, both converge to 


exp 


KA“2'’'-")^ ^ I _ exp { - KA”!® (()}, *=1,2. 


xeDjv 




Moreover, on A, the local time functionals (t) concentrate on t simultaneously. It 
follows that on P-a.s. as N ^ oo, 

2 


E 


e: 




= e 


-2KX°‘t 


+ 0 ( 1 ). 


Noting again that occurs P-a.s. for N large enough, this shows flb.lip . and hence the 
lemma. □ 


Remark 6.4. (a) Inspecting the last proof carefully, it follows that Lemma 16.31 can be 
slightly strengthened. Namely, the stated convergence holds a.s. with respect to ^ and 
and in probability only with respect to E. The same remark then applies to Theorem ll.il 

(b) A closer analysis of the errors made in the computation of the quasi-annealed Laplace 
transform, in particular in 06.611 . shows that the error in Lemma 16.21 and 06.lip is of order 

log^ A^), where the logarithmic part comes from the log^u part in 06.611 . u being 
bounded by N'^rriM, and rriN being polynomial in N. Therefore the variance decay is not 
enough to apply the Borel-Cantelli lemma and obtain P-a.s. convergence. 

(c) Note also that the previous proof, more precisely bounding the log^u part of 06.6p . 

requires that log(A^^mAr) -C This is where our improved techniques to estimate the 

spectral gap in Proposition 13.11 are necessary. As we already remarked, the techniques of 
|FIKP98] show roughly that rriN < e^^NTogTv only, which is not sufficient. 

6.3. Joint convergence of increments. In the next step, we extend the convergence to 
joint convergence of increments. 

Lemma 6.5. The increments of the rescaled clock processes gkSr/itRN) converge jointly 
in F-probability in Pf-distribution to the increments of an a-stable subordinator. 


Proof. Fix k > 1 and 0 = to < ti <■■■< t^. We will show that for every Ai,..., Afc G 
(0, cxo) and P-a.e. environment r. 


lim E 

N^OO 


T 

V 


Yfi^l Ai(Sx>(URAr)-S-D(ti-iRAr)) 



2=1 




( 6 . 12 ) 
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Then the lemma follows by using the above proved convergence in P-probability in 
distribution of the one-dimensional marginals. 

Let P = \tiRN — tjPAr], For a set J C [0, cxo), let V(/) be the event 

V(/) = {n ^ Vn for all s el}. 

On the event V (uf^^P), for every i < k, 

Sx>(tiR]sf) — Sx>(ti-iRN) = SvitiRN — — Sx>(ti-iRN). (6.13) 

Moreover, by Lemma [5.31 P-a.s. for all x G Pat, for e > 0 small and N large enough, 

< N^rriM] < 

By 02.81) . IPa^I < c 2 d“')'')^, hence the expected number of vertices x G Pat visited in a 
time-interval of length is smaller than P-a.s. for N large enough. This 

still holds for a hnite union of intervals of length and so we conclude that by the 

Markov inequality, PJ" [V (ujL;^P)] -e 1, P-a.s. as iV —)■ oo. 

The reason to shorten the time intervals as above is to give the Markov chain Y the time 
it needs to mix. Dehne the event 


M = {Tmix o OuRn-n^tun < Vi = 1 ,..., fc}. 

It is easy to see using Proposition 13. 3l that P^ [Al] —)■ 1, P-a.s. as iV —)■ cx). On the event M. 
the Markov chain Y always mixes between tiRpj — N'^rnN and tiRpj and thus, by Lemma ITTI 
for every i = 1,..., /c and y G Hat, 


PI[YuRn =y\M] = Py. 


Therefore, on Al, 

{Sv{URp, - N^tun) - Sv{U.rRN))^^^_^ = (s^i\{U - U-i)Rn - N^^n))^ , 

””’(6.14) 

where the are the clock processes of the deep traps of independent stationary started 
processes having the same law as Y. 

Combining observations 06.131) and 06.141) . with the estimates on the probabilities of 
V (uf^^P) and Al, since the integrand is bounded by 1, we obtain that P-a.s. as A —)■ oo. 


-■A 52?=! ^i{ST>{tiRN) — S-D{U-lRN)) 


= e: 


= e: 




2 = 1 


{{U-ti-i)RN-N’^misi) 




+ 0 ( 1 ) 
+ 0 ( 1 ) 


2=1 




+ 0 ( 1 ). 


Using analogous arguments it can be shown that for every i = 1,..., /c, P-a.s. as A ^ cx. 


E 


T 

U 




El 


g L-l)Rjv) 


+ 0 ( 1 ). 


Combining the last two equations proves 06.12p and hence the lemma. 


□ 
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6.4. Tightness in the Skorohod topology. The last step in the proof of Proposition l6.ll 
is to show tightness. 

Lemma 6.6. The sequence of probability measures Pf[gf/^ST>{tRN) G • ] P-a.s. tight 
with respect to the Skorohod Mi-topology on Z1([0,T],M). 

Proof. The proof is standard but we include it for the sake of completeness. By |Whi02[ 
Theorem 12.12.3], the tightness in the Skorohod Mi-topology on Z1([0,T],M) is character¬ 
ized in the following way: For / G Z1([0,T],M), 6 > 0, t E [0,T], let 

Wf{6) = sup I inf \f(t) - (afiti) + (1 - a)f(t 2 ))\ : h < t < t 2 < T, t 2 - h < sX , 

[as [0,1] J 

Vf{t,6) = sup{|/(fi) - /(ta)! : ti,t 2 e [0,T] n (t - <5,t 5)} . 

The sequence of probability measures Pn = Pf[gfi^ST>{tRN) G •] on Z1([0,T],M) is tight 
in the Mi-topology, if 

(i) For every e > 0 there is c such that 

PN[f: ll/lloo > c] < £, iV>l. (6.15) 

(ii) For every e > 0 and rj > 0, there exist 6 G (0, T) and Nq such that 

RnU : vjf{6) >v]<e, N> No, (6.16) 

and 


PAf ■ Vf{0, 5)>r]]<e and Pat)/ : Vf{T, 6)>r]]<e, N > Nq. (6.17) 

Since the clock processes are increasing, fl6.15p is equivalent to convergence of the dis¬ 
tribution of gf/^S-DiTR^), which follows from the convergence of the Laplace transform of 
the marginal at time T. fl6.16p is immediate from the fact that the oscillating function 
Wf{6) is always zero since the processes gf/^ST/itR^) are increasing. To check fl6.17p . again 
by the monotonicity of the gf/^ST>(tR]\f) it is enough to check that for 6 small enough and 
N P Nq, Pf[gN^Sx>{5RN) > v] < £■ By the convergence of the marginal at time 5, we may 
take 6 such that P[14(5) > ?;] < | and Nq such that for N > Nq, 


pr 

u 

— Sd{SRn) > g 


_9n 


The reasoning for Vf{T, h) is similar. 


P[K(5) >h] 



□ 


7. Shallow traps 


In this section we show that the convergence of the clock process of the deep traps shown 
in Section [ 6 ] is enough for convergence of the clock process itself. 

Proposition 7.1. Under the assumptions of Theorem. \1.1[ the clock process of the deep 
traps approximates the clock process, namely, for every t > 0, 

— [S(tRN) — S-riitRN)) 0 P-a.s. in Pf -probability. 

Qn 

Proof. We will split the set of shallow traps iSAr:=ElAr\'DAr into two parts and separately 
deal with the corresponding contributions to the clock process. 

We start with ‘very shallow traps’. Let <5 > 0 be a small constant which will be hxed 
later and hjsf = ^. Dehne the set of very shallow traps as 

Sn = {x E Hat : < h^}. 
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The contribution of this set to the clock process can easily be neglected as follows. Write 


dN Jo 




(1 V 


9n 


^ (1 V Ta,)E;[itR^{x)] 


x£Sn 


Note that El[£tR^{x)] = vJ^Rn = A T^)tRN, and (1 V r^)(l A < hiy on S^. 

With fl2.3p for Z^f, and Lemma [4.41 for Rj^, for every e > 0, P-a.s. for N large enough, the 
right-hand side of the last equation can be bounded from above by 

To obtain exponential decay of this expression, it is enough to take account of the exponen- 

a^/3^ 

2 log 2 ’ 


tial part of gjsf, which is Then, up to sub-exponential factors, using that 7 = ^ 


the above is bounded by 

exp {((5 — l)a/3^ -|- + elog2)A^}. 

Since a < 1, by choosing e and 6 small enough this can be made smaller than for 
some c > 0. Applying the Markov inequality and the Borel-Cantelli lemma, 

P-a.s. in PJ-probability. (7.1) 


1 

— (iVryJl 
dN Jo 


{YsGSn}^^ 


To control the contribution of the remaining shallow traps Sn \ S^, we hrst split this 

1 


set into slices as follows. Set 


In = 


log 2 


(logs-w - loghTv) 


Note that by dehnition of g'^ and h^, for 6 small as hxed above. In = cN + 0(1) for some 
c > 0. For i = 1,..., Jtv, let 

S\^ = {x^Sn\Sn-. e [2-*^)v, 

so that Sn\Sn = 

We next control the sizes of the slices By the tail approximation fl 2 . 2 p . for all 

i = 1 ,..., In, 

1 


nv e 5^] < p 


Px > 


I3Vn 


Oogg'N - ilog2) 


= /«,' exp I - la'VW + a'i log 2 - /JJ] - o( 1 )} (1 + o( 1 )). 


(7.2) 


We separately control the two expressions and f^\. The hrst one equals 


f(i) _ 
J N,i ~ 


a'/3V2^ 


^{logg'^-i\og2) 


To control this, note that by dehnition oi In, for alH = 1,..., Jjv, 

log5')v — Oog2 > log/iAT — log2 = 6a(I^N — log2. 

It follows that, for alH = 1,..., /at, is bounded by some constant c > 0, which can be 
chosen to be independent of i. The second expression to control in fl7.2p is 


^( 2 ) _ log^ 2 ^ z log 2 
In.i ~ 


2P^N ^ a'P^N 


■ log(a'/3\/27rA^). 
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This is strictly positive, so it can be omitted in fl7.2j) in order to obtain an upper bound. 
Using the obtained control on and in fl7.2p . as well as the fact that 7' = fj^, we 
conclude that for alH = 1 ,..., /at, 

P[i/ e 5^] < c2-^'^2“'h 

In particular, the size \Slf\ of the i-th slice is dominated by a binomial random variable 
with parameters n = 2^ and p = c2“*2“'^^. Then it follows by the Markov inequality 
that for every e > 0, 

F[\Si^\ > < 2 ""^. 

Since In = cN + 0(1), a union bound and the Borel-Cantelli lemma imply that for every 
e > 0, P-a.s. for N large enough, 

\Sif \ < 2^^c 2“'‘2(^-^')^, for alH = 1,..., In- (7.3) 


Coming back to the contribution of the intermediate traps Sn \ Sn to the clock process, 
we use as before that El,[£tRj^{y)] = UytRN = ^^^tRN, and (1 V Ty){l A Ty) = Ty < 2~^^^g'j^ 
on Slf. With 02.31) for Zn, Lemma ITTI for Rn, and 07.31) for the size of S^j, we obtain that 
for every e > 0, P-a.s. for N large enough, for all i = 1,..., In, 


El 




9n Jo 


(1 V ryjl{y^g5i^}ds 


9n 


Ty)Ei;[itR^{y)] 


< 9N^\<SN\2~^^^g'^Z]lf^tRN 

^ 2(«-l)i2(7-7'+2e)Y 

~ 9n 


Summing over i = 1,... ,In, P-a.s. for N large enough. 


El 


9n Jo 


(1 Vryjl 




i , ds 


~ 9n 


(7.4) 


We claim that the right hand side of 07.4p decays exponentially in N for e > 0 small 
enough. To this end, as before, it is enough to take account of the exponential parts in 
both gN and g'^, which contribute to the right hand side of 07.41) by 


Hence, to show the exponential decay on the right hand side of 07.4p . it is sufficient to 
prove that we can choose e > 0 small enough, such that 

(\/Y-\/7)¥ + 7-7' + 2£<0. (7.5) 

Pc 

With a first order approximation of the concave function y/x at 7 , 

^(7-7') < a /7 - Vy. 

Since, ^ > || and a < 1, this implies 


0(7-7') < 
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and 07.51) thus holds for e > 0 small enough. The right hand side of 07.4p then decays 
exponentially, and with Markov inequality we conclude that 



This together with 07.ip hnishes the proof of the proposition. 


□ 


8. Conclusion 


Theorem 11.11 is a direct consequence of Propositions 16.1117.11 and Lemma 14.41 


Appendix A. Extremal characterization of mean hitting time 

In this appendix we give the proof of the formula 04.11) which gives a lower bound on the 
mean hitting time of a set when starting from s tationari ty. This formula is a continuous¬ 


time version of (a half of) Proposition 3.2 from CTWll . This proposition, as well as the 


underlying result |AFn21 Proposition 3.41], are stated for a continuous-time Markov chain 


whose waiting times are mean-one exponential random variables. We were not able to hnd 


analogous statements for general continuous-time Markov chains in the literature, so we 
provide short proofs here, for the sake of completeness. 

We start by introducing some notation. Let E be a reversible continuous-time Markov 
chain on a hnite state space S with transition rates q^y and invariant probability measure 
z/j,, denote by and the laws of Y started stationary and from x respectively, and by 
Py, Ex the corresponding expectations. Dehne the conductances as c^y = i^xQxy = J^yQyx- 
Let Qx = J2y Qxy and Cx = Yly The transition probability from x to i/ is pxy = £££ = 

In the same way as in Section [2l we dehne the hitting time Hx and the return time to 
x by E, and similarly Ha and H\ for sets A C S. 

A function on 5 is called harmonic in x, if J2y 9iy)Pxy = For x E S and 

B C S \ {x}, the equilibrium potential g* ^ is dehned as the unique function on S that is 


harmonic on (x U B)^, 1 on x and 0 on B. It is well known that 


= PylHr < Hb]. 


For a function g : S ^ the Dirichlet form is dehned as 



where y ^ z means that y and are neighbors in the sense that qzy > 0. 


The following proposition is the required generalization of Proposition 3.2 of CTWll 


Proposition A.l. For every x E S and B G S \ {x} 



To prove this proposition we will need a lemma which is a generalization of |AF02[ 
Proposition 3.41] giving the extremal characterization of the mean hitting time. 

Lemma A.2. For every x E S, 



g :S g{x) = 1, i^yg{y) = 0 




(A.3) 
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Proof. The proof follows the lines of |AFn2] with some minor changes to ht into the setting 
of general continuous-time chains. 

We first show that there is a minimizing function g that equals g{y) = where 


^yx 


Py[Yt = x]- dt. 


To this end, we introduce the Lagrange multiplier 7 and consider g as the minimizer of 
D{g,g) + i 'Yfz ^zg{z) with g{x) = 1. The contribution to this of giy) ioi y ^ x is 


T^ydyzigiv) - g{z)f + -fiZygiv), 


z^y 


which is minimized if 


2 - g{z)) + ITZy = 0. 


2~y 


From this we get for all 1/ G iS, by introducing the term including the parameter f3 for the 
case y = X, that 

giv) = ~ - -1" ~^{y-x}- 

Multiplying by qy and Vy., and summing over all y ^ S, 

EE ^ydyzgiv) = EE ^yQyzg^Z^ 




y z^y 


y z^y 


By reversibility VyQyz = ^z^zy, so the term on the left and the hrst term on the right are 
identical, which gives | Thus there is a minimizing g such that 


qy 


Hy 

Z'^y ^ 


(A.4) 


We now show that up to the factor P the function y i-A Zy^ satishes the same relation. 
Indeed, by the strong Markov property at the time Ji of the hrst jump of E, which under 
Py is an exponential random variable with mean -P, 


Qy ' 


rJl 


Zyx 


Qy 


{i{y=x}-i^x)dt+y2— f 

) + E 


z^y 


Qyz 


Zzx. 


z^y 


The function g[y) = thus satishes the constrains of the variational problem in flA.Sp 
and fulhlls flA.4p with P = l/Z^x- It is thus the minimizer of this variational problem. 

Moreover, by |AFn21 Lemmas 2.11 and 2.12], we have Z^x = Ey[PIx\i'x and VxEy[Hx\ = 
Zxx — Zyx- Denoting hiy) = Ey[Hx] and using these equalities, we obtain 

D(9.g)^-zrET;D{h,h)= ^ 


where for the last equality we used D{h,h) = E^[Hx], by e.g. |AB92l Lemma 6]. This 
completes the proof. □ 


With this lemma the proof of Proposition lA.ll follows the lines of CTWll 
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Proof of Proposition AA. 1[ To prove the inequality in flA.2jl . it is sufficient to modify the 
function g* ^ so that it becomes admissible for the variational problem in Lemma IA.21 
Write g* for g*^^ and dehne ^ on 5 as 

9*{z) - T.y&s^y9*{y) 


9{.z) = 


1 - ^y9*{y) 


Then g equals 1 on a; and Ylz&s ^z9{z) = 0. Hence, by Lemma IA.21 


E.[H^ 


< D{g, g) = D{g*, g*) [ 1 - ^y9*{y) ) 

V y&s J 


-2 


But g* is non-negative, bounded by 1 and non-zero only on B'^, therefore ^y9*{y) ^ 

the hrst part of Proposition lA.ll follows. 

To prove the equality in flA.2p . we show that 


D{<b,31b) = Prl«: > «b]c,. (A,5) 

Indeed, let again g* = g*^- 9* is harmonic in z, the second sum in the Dirichlet form 

dOD is 

^^y(9*iz) - 9*iy))^ = ^^y(9*{yf - 9*{zf)- 

y^z 

This shows that the contribution to the Dirichlet form of every edge that connects two 
vertices in which g* is harmonic or zero vanishes. Therefore D{g*,g*) reduces to 

0(9*.s') = i ('£ 3^,0 - 3*{v)? + £c„(l - 9*(!/)'5 ) 

y'^x / 

~ ^ ^ Cxy(l — g (y)) 

y'^x 

^ ^j Pz:yPy\Hx ^ 

yB^x 

= c,Px[H+ > Hb]. 

This proves flA.5D and thus the proposition. □ 
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